Opening up the Quantum Three-Box Problem with Undetectable Measurements 
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One of the most striking features of quantum mechanics is the profound effect exerted by mea- 
surements aloneP^. Sophisticated quantum control is now available in several experimental sys- 
tem^^'^, exposing discrepancies between quantum and classical mechanics whenever measurement 
induces disturbance of the interrogated system. In practice, such discrepancies may frequently be 
explained as the back-action required by quantum mechanics adding quantum noise to a classical 
signaP. Here we implement the 'three-box' quantum game of Aharonov and Vaidman* in which 
quantum measurements add no detectable noise to a classical signal, by utilising state-of-the-art 
control and measurement of the nitrogen vacancy centre in diamoncP'^. Quantum and classical me- 
chanics then make contradictory predictions for the same experimental procedure, however classical 
observers cannot invoke measurement-induced disturbance to explain this discrepancy. We quantify 
the residual disturbance of our measurements and obtain data that rule out any classical model 
by > 7.8 standard deviations, allowing us for the first time to exclude the property of macroscopic 
state-definiteness from our sy stem. Our experiment is then equivalent to a Kochen-Spekker test of 
quantum non-contextualitjl^ that successfully addresses the measurement detectability loopholj^. 



Classical physics describes the nature of systems that 
are 'large' enough to be considered as occupying one 
definite state in an available state space at any given 
time^^. Macrorealism (MR) then applies whenever it 
is possible to perform non-disturbing measurements 
to identify this state without significantly modifying 
the system's subsequent behavioui'^. Macrorealism al- 
lows the assignment of a definite history (or probabili- 
ties over histories) to classical systems of interest, but 
the MR condition can break down for systems 'small' 
enough to be quantum mechanical (QM) during times 
'short' enough to be quantum coherent — times and 
distances that now exceed secondJ^^ and millimeter^ 
in the solid state. How can we tell whether a particular 
case is better described by QM or MR? If there is a 
cross-over between these, what does it represent? 

One explanation for the breakdown of MR is that 
measurement back-action (either deliberate measure- 
ments by an experimenter or effective measurements 
from the environment) unavoidably change the state in 
the quantum limit, excluding macrorealism due to a 
breakdown of non-disturbing measurability. This posi- 
tion is supported by 'weak value' experiment^^^E^ that 
explore the transition from quantum to classical be- 
haviour as a measurement coupling is varied; Quantum 
behaviour is found under weak coupling, whilst MR- 
compatible behaviour is recovered when strong projec- 
tive measurements effectively 'impose' a classical value 
onto the measured quantum systerrP^. 

We examine a case in which the back-actions of se- 
quential 'strong' projective measurements impose new 
quantum states that provide no detectable indication 
of disturbance to a 'macrorealist' observer. We show 
these state are still incompatible with macrorealism, 
however, as no possible MR-compatible history can be 
assigned to the process as a whole. Our experiment 
can be described as a game played by two opponents 
(Alice and Bob) who take alternate turns to measure 



a shared system. The system they share may or may 
not obey the axioms of macrorealism. For the purposes 
of the game. Bob assumes he may rely on the MR as- 
sumptions being true, and only Alice is permitted to 
manipulate the system between measurements. If Bob 
is correct to assume MR holds, the game they play is 
constructed in his favour, yet 'paradoxically' the exact 
same sequence of operations will define a game that 
favours Alice when a quantum-coherent description of 
the system is valicP^. 

Experimentally, we use the ^''NV^ centre {S = 1, 
1 = 1) in diamond as Alice and Bob's shared system, 
enabling us for the first time to maintain near-perfect 
undetectability of Bob's observations. The e xperim ent 
involves a three level pre- and post-selectiorP^'i^ that 
is known to be equivalent to a Kochen-Spekker test of 
quantum non-contextualitjffil_ Such tests are only pos- 
sible in d > 3 Hilbert space^^^, and recent advances in 
the engineering^ and controP of the NV system enable 
the multiple projective non-demolition measurements 
that are crucial to observing Alice's quantum advan- 
tage in the lab. We describe the gam^ and Bob's ver- 
ification of it from the MR perspective, then discuss 
the experiment and results from the QM position. We 
quantify the incompatibility of our results with macro- 
realism through use of a Leggett-Garg inequalitjC^ and 
discuss the implications of our result. 

In the 'three-box' quantum gam^, Alice and Bob 
each inspect a freshly prepared three state system (clas- 
sically, three separate boxes, hiding one ball) using an 
apparatus that answers the question: "Is the system 
now in state j?" ("Is the ball in box j'?") for j = 1, 2, 3 
by responding either true (1) or false (0). The ques- 
tion is answered by performing one of three mutually 
orthogonal measurements Mj. The game allows Bob a 
single use of either Mi or M2- Alice is allowed only to 
use M3, and additionally to manipulate the system. Al- 
ice is allowed one turn (a manipulation either before or 



2 



b) 



Electron^ C 

Spin ;v'i ^ 

Solid Immersion Lens 




"Shelving" "Readout" "Game" 



Im^.m,) 
1-1.0) 



• 1-1-1) 12) 

-■■|-1,+ 1>-Il> 
QM States 




fiA "Bright" State 
'^'j <n>=10.4 
P(n>l) = 92% 
t,„=50MS 



10 20 30 40 50 



A/) "Dark"state 
'^'i <n>=0.08 
P{n=0) = 98% 
t ,o = 50ms 



Classical 'Three Box' States 



AFrequency (MHz) 



, 5 10 
# photons, n 



FIG. 1. The 'three-box' game is implemented using the N nuclear spin of the NV centre in diamond, 
measured using the electron spin, a) Schematic of the NV~ defect in diamond and representative diamond lens used 
in the measurements, b) The magnetic moment of the electron spin is quantised into one of three values, ms = —1, 0, +1. 
These states split into a further three (mj — — 1,0, +1) according to magnetic moment of the '^^N nuclear spin. The 
ms = ±1 states fluoresce via the Ai transition, whilst ms ~ fluoresces via the E^, transition. We use the ms = — 1 
manifold to hold the three states in the game, conditionally moving the state between ms = — 1 and ms ~ dependent 
on the nuclear spin sublevel mi. These three mj states are taken to correspond to the configurations of a hidden ball, c) 
We identify the allowed microwave transitions (Ams = 1, Am/ = 0), that provide the Mj readouts, d) Photon counting 
statistics, in each case from 10,000 trials, observed during a typical projective read-out indicate the presence (top) or absence 
(bottom) of optical fluorescence, corresponding to outcomes Mj and ^Mj respectively. 



after an M3 measurement) before Bob to prepare the 
system, and one turn following him. Alice attempts 
to guess Bob's measurement resuh, and the pair bet 
on Alice correctly answering: "Did Bob find Mj to be 
true?". Alice offers Bob ^ 50% odds to predict when 
his Mj was true, although she may 'pass' on any given 
round at no cost when she is uncertain. 

Bob realises that if the Mj measurements are per- 
formed on a system following MR axioms, Alice must 
bet incorrectly > 50% of the time, even if Alice could 
'cheat' by knowing which value will be presented 
(classically, knowing which box contains the ball); yet 
with three boxes and his free choice between Mi or 
M2, Alice is prevented from using such knowledge to 
win with > 50% success rate. Bob expects to win if the 
Mj measurements reproduce the behaviour of a ball 
hidden in one of three boxes. The conditions for this 
are: a) The Mj measurements are repeatable and mu- 
tually exclusive, so that Mj A Mk = Sjk] (classically, 
the ball does not move when measured); b) for any trial 
Afi VM2VM3 = 1 (there is only one ball, it is definitely 
in one of the boxes); and c) Bob has equal probabil- 
ity to find each j-value when measuring a fresh state, 
with Pm,(S) = 1/3 j e 1,2,3. (The baU is placed at 
random). The conditions a) - c) serve to prevent Alice 
from learning Bob's Mj result in any macroreal system. 
Before accepting Alice's invitation to play. Bob verifies 
properties a) - c) hold experimentally, by carrying out 
Mj measurements. During verification, the game rules 
are relaxed and Bob is permitted to make pairs of se- 
quential measurements, checking Mj V Mj. = Sjk', He is 
also allowed to measure every Mj including M3, which 
will be reserved for Alice once betting commences. 

When Bob is satisfied that a) - c) hold, the game 
appears fair from his macrorealist standpoint. Bob 
accepts Alice's wager and play commences with Alice 
preparing a state, which Bob measures using either Mi 
or M2, whilst keeping his j-choice and Mj-result secret. 
Alice manipulates the system, uses her M3 measure- 
ment and bets whenever her M3-result is true. Believ- 
ing that Alice could only guess his result. Bob accepts 



Alice's wager. Doing so, he finds Alice's probability 
of obtaining a true M3-result is Pm3(^) — 1/9, in- 
dependent of his j-choice. Under macrorealism, Bob 
could account for this only through Alice using a non- 
deterministic manipulation that would reduce the infor- 
mation available to her from the M3-result. To Bob's 
surprise, when Alice plays, her M3-true results coin- 
cide with rounds on which Bob's Mj-result was also 
true. She passes whenever Bob's M^-result was false. 
In a perfect experiment, she would win every round she 
chose to play; even in a practical experiment she can 
achieve significantly more than the 50% success rate 
which would be predicted by MR. To understand Al- 
ice's advantage we must examine the game from a QM 
perspective. 

Alice uses the initial M3 measurement to obtain the 
pure quantum state |3). She applies the unitary Ui that 
operates as Ui — \T)(^\^ (^orthogonal terms), to produce 
the initial state 



Her operation presents the state |/) to Bob, who next 
measures Mj on |/), performing a projection. If Bob's 
Mj-result is true, he has applied the quantum projector 
Pj = \j){j\ while by finding an A/j-result that is false, 
he has applied Pj- = 1 — Alice would then like 

to measure the component of the state left by Bob's 
measurement along the final state \F) — (|1) + |2) — 
|3))/\/3. Bob's projectors on Alice's initial and final 
states |/) and \F) obey: 

|(F|A|/)P = |(F|P2|/)P = l/9 (2) 
|(F|A^|/)P = |(F|P2^|/)P=0 (3) 

for both j = 1 and j — 2. Alice cannot directly mea- 
sure \F), but is able to transform state \F) into state 
1 3) with a unitary Up — |3)(P| -I- [orthogo'nal terms), 
and then use her measurement of A/3 as an effec- 
tive Mp measurement. Alice therefore obtains M^- 
true when Bob's Mj-result is true with probability 
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PMMnB) = |(3|;7fP,;7i|3)|2 = = 1/9, 

and when Bob's Afj-result is false PM^iA D ^B) = 
|(3|t/FPj^?7i|3)|2 = \{F\P^^\I)\'^ = 0. Alice finding her 
Ms-result true is conditional on Bob leaving a compo- 
nent of \ipj) along to do so, his Af^ -result cannot 
have been false. Alice's probability conditioned on Bob 
is then Pmj{B\A) = 1. Alice bets whenever her Ma- 
result is true, playing Vsj of the rounds and winning 
each round she plays. 

Our implementation of this game uses the nitrogen- 
vacancy centre (NV^), which hosts an excellent three- 
level quantum system for implementing this 'three-box' 
game: the ^''N nucleus which has {21 + 1) = 3 quan- 
tum states (see Fig. [T^). Although we cannot (yet) su- 
perpose a physical ball under three separate boxes, by 
using radio frequency (RF) pulseJ^^l we can readily pre- 
pare the "'^'^N angular momentum into a superposition 
of alignment along three distinct spatial axes, providing 
three mutually exclusive 'box states' in the macrorealist 
picture. We work in the electron spin ms = — 1 mani- 
fold, and assign eigenvalues of nitrogen nuclear spin m/ 
to the box-states j according to: a) |m/ = — 1) ^ b = 
b) \mi = +1) ^ \j = 2) c) \mi - 0) - \j = 3) (see 
Fig.[l>). 

Preparation and readout of the ^''N nuclear spin is 
provided via the NV~ electronic spin (5 = 1). We use 
selective microwave pulses to change ms conditioned 
on m/, and then read out the electron spin in a sin- 
gle shot and with high fidelitj'^, by exploiting the NV 
centre's electron spin-selective optical transitions. The 
spin read-out achieves 96% fidelity and takes « 20 fis, 
much shorter than the nuclear spin inhomogeneous co- 
herence lifetime of Tj* ^ 1ms at T = 8.7 K, enabling 
three sequential read-out operations during a single 
coherent evolution of the system, as required for our 
'three box' implementation. We achieve all steps of the 
quantum experiment well within the coherence time of 
our system and therefore make no use of refocussing 
pulses. 

The full experimental sequence is shown in Fig. [2j 
with further details in the Supplementary online in- 
formation. The initial state |3) is prepared by pro- 
jective nuclear spin readout utilising a short duration 
(~ 200 ns) optical excitation. The subsequent experi- 
ment is then conditioned on detection of at least one 
photon during the preparation phase, which heralds 
|3) with > 95% fidelity (Fig. [iji) at the expense of 
< 1% preparation success rate. Once |3) is her- 
alded, all subsequent data is accepted unconditionally 
for the Leggett-Garg test. After initialisation, Alice 
transforms the state |3) into |/) via two RF pulses (see 
Supplementary online information) and hands the sys- 
tem to Bob. 

Bob picks a secret j-value and maps the correspond- 
ing nuclear spin projection to the electron spin by ap- 
plying a microwave 7r-pulse to drive a transition from 
one of the ms = — 1 states is |1) or |2)) into the 
ms=0 manifold. He then uses optical measurement of 
the Ex fluorescence to determine ms- Absence of fluo- 
rescence (an 'E^j-dark' NV~) implies ^Mj and collapses 
the electron state into ms — — l whilst performing P^- 



on the nuclear spin. (Fig. |3^.ii). We find that nuclear 
spin coherences within 7715=— 1 are unaffected by the 
-^Mj read-out process. 

Detection of n > 1 photons during Bob's 20 /is read- 
out projects into ms — and corresponds to an Mj- 
result true. In such events, there is ~ 70% chance the 
electronic spin will be left in an incoherent mixture of 
ms = ±1 following readout, due to optical pumping. 
Conditionally on Bob's Afj-result being true, we take 
care to undo the mixing effect of this mixing as follows: 
We first pump the electron spin to ms — by selective 
optical excitation of ms = ±1 (Ai light), followed by 
driving a selective a microwave pulse from ms = to 
ms = —l (Fig. [ij;). This procedure is effective, because 
the optical fluorescence preserves the nuclear spin pop- 
ulations TO/ that encode the game eigenstates in > 70% 
of cases (see Fig. [sja) . Bob performs repeated pairs of 
measurements, verifying that from a macrorealist 's per- 
spective, that performing Mj is equivalent to opening 
one of three boxes containing a hidden ball. Bob finds 
the probability for each Mj is ~ 1/3 (Fig. |3^.i). Bob 
performs consecutive Mj observations and verifies that 
finding Mj {-^Mj) true on one run implies that a sub- 
sequent measurement of Mj {-^Mj) will also be true 
(Fig. ^jp,c) gathering statistics over N = 1200 trials for 
each combination. 

Once Bob has measured in secret, Alice predicts his 
result by mapping \F) to |3) and performing M^. Alice 
accomplishes this via: \F) — > |/) — > |3). The Berry's 
phase associated with 27r rotation^^Sl provides the map 
\F) — >■ |/) via two RF pulses that change the signs of 
the {|1), |3)} and then {|2), |3)} states. State |3) then 
acquires two sign changes yielding \F) up to a global 
phase. The map UJ^ from \I) to |3) is then achieved 
by inverting the order and phase of Alice's initial C/j 
pulses. (See Supplementary online information). 

Alice and Bob then compare their measurement re- 
sults during N = 2 * 1200 rounds of play, distributed 
evenly across Bob's two choices of Mj measurement. 
Alice finds her final Ma-result is true in ~15% of cases, 
independent of Bob's choice of measurement context 
between Mi and M2 (Fig. |4^). Amongst those ~15% 
of cases where Alice's Ma-result is true and she chooses 
to bet. Bob finds she wins >67% of such rounds, for ei- 
ther of Bob's choices between measuring Mi and M2 
(Fig. |4j3), confounding the macrorealist expectation. 
The principle source of error in our experiment arises 
from imperfect control of the nuclear spin, (see Sup- 
plementary online information). 

To quantify the apparent incompatibility with 
macrorealism, we construct a Leggett-Garg function for 
our system, defined as: 

(K) = {Q1Q2) + (Q2Q3) + (Q1Q3) (4) 

where Qj are observables of our system recorded at 
three different times, derived from Alice and Bob's mea- 
surementJ^. We assign Qj = +1 whenever an M^- 
result is true (or could be inferred true in the MR pic- 
ture) and assign Qj = —1 otherwise. The initially her- 
alded state |3) fixes the value of Qi — +1 always, and 
values for Q2 and Qa are taken directly from Bob and 
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FIG. 2. Microwave, rf and optical pulse sequence implementing the 'three-box' experiment, i) Initialisation 
consists of preparing the NV~ state via charge-state verification (CSV) and measurement-based initialisation (MBI) into 
state 1 3) followed by purification of — and application of [//. ii) Bob's measurement Mj consists of: Moving population 
from ms = —1 to ms = conditioned on mj, indicated 7r[m/] in the figure, followed by monitoring of E^^ fluorescence. If 
fluorescence is observed, a 'repopulating' sequence via the spectrally resolved Ai fluorescence (Aai — A_b^ = 5 = 1.89 pm) 
resets ms = —1 whilst leaving mj unchanged and ready for Alice's measurement, iii) Alice's measurement consists of the 
unitary Uf, followed by read-out of Ms in the ms = — 1 and ms = +1 sublevels. Further details on the experimental 
sequence are provided in the Supplementary online information. 



Alice's measurement results. The Leggett-Garg func- 
tion satisfies -1 < {K) < +3 for all MR systemJl^l^ 
and for the present system, we can show (K) is related 
to Bob and Alice's statistics as (see Supplementary on- 
line information): 

(K) - ^ (1 - PmAB\A) - PmAB\A)) - 1 (5) 

where PMj{B\A) is the probability that Bob finds the 
Mj -result true, given that Alice has also found her final 
Ma-result true. Macrorealism asserts that Mi and M2 
are mutually exclusive events, whereas QM does not, 
so that: 

MR : Pm, iB\A) + Pm, iB\A) < 1 (6) 
QM : Pm, iB\A) + Pm, {B\A) < 2 (7) 

Under QM assumptions, eqn. [s] satisfies (K) > 
— 13/9 = —1.44, possibly lying outside the range com- 
patible with MR. We determined (A') by estimating 
Pm {B\A) during N ~ 1200 trials of Bob measuring 
j = 1 and j = 2, finding (K) = -1.265 ± .023 in the 
quantum game, corresponding to a ~11.3 a violation 
of the Leggett-Garg inequality under fair sampling as- 
sumptions, and ~ 7.8 a violation in a maximally ad- 
verse macrorealist position in which all undetermined 
measurements are assumed to represent Alice 'cheating' 
and are reassigned to minimise the quantum discrep- 
ancy from MR predictions (see Supplementary online 
information). 

Our results unite two concepts in foundational 
physics: Leggett-Garg inequalities and pre- and post- 
selected effectJi^ in a quantum system to which the 
Kochen-Specker no-go theorem applies^*^. Experimen- 
tal studies of the Legg ett-G arg inequality have previ- 
ously utilised ensemble d^ l ^ ^ l, or made assumptions re- 
garding process stationaritjI^Hini^ or have required weak 



measurement^^ to draw conclusions, whilst the exist- 
ing studies of the three box problem cannot incorpo- 
rate measurement non- detectabilitjJS^, presenting a 
loophole that allows classical non-contextual models to 
reproduce the quantum statisticJ^i'. We have studied 
the 'three box' experiment on a matter system, as origi- 
nally conceivecP' and developeti^in terms of sequential, 
projective non-demolition measurements, and we may 
therefore re-examine the conclusions that can be drawn 
when using this improved measurement capability. 

Two assumptions underpin macrorealism; 1: macro- 
scopic state dcfinitcness and 2: non-disturbing mea- 
surability. In previous studies it has been possible to 
assign violations of the Leggett-Garg inequality to a 
loss of non-disturbing measurability, in both opticaP^ 
and spin-basecP experiments. The disturbance due to 
measurement can sometimes be surprisingly non-locaP^ 
and it has been suggested that detectable disturbance 
is a necessary condi tion f or violating a Leggett-Garg 
inequality in all case d^^ * ^^ . Our results show explicitly 
that Alice cannot detect Bob's measurement (Fig. [4^), 
so that the measurement has no detectable disturbance, 
whilst the statistics violate a Leggett-Garg inequality. 
We are therefore able to rule out the macrorealist 's as- 
sumption 1: of state definiteness. 

Our experiment makes use of a three-level quantum 
system in which Bob's choice between Mi and M2 rep- 
resents a choice of measurement 'context' in the lan- 
guage of Kochen and Spekker^^^. If Bob is able to keep 
his measurement context secret, then a macrorealist Al- 
ice could only use a 'non-contextual' classical theory to 
describe the experiment. It is known that every pre- 
and post-selection paradox implies a Kochen-Specker 
proof of quantum contextualitjW. It has been argued 
that measurement disturbance provides a loophole to 
admit non-contextuality into classical models (in ad- 
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FIG. 3. Bob verifies tlie non-detectable nature 
of the Mj measurements, a) Measurement within the 
ms = — 1 manifold only: i) Bob's measurement results when 
observing the state |/) in the \j) basis are independent of the 
j value selected to within experimental error. Repeatability 
is illustrated by plotting the result of a second Mj measure- 
ment within ms = — 1, conditioned on (ii) the result -^M2, 
or (iii) the result M2. b) The repeatability of each Mj mea- 
surement is studied within the ms = — 1 manifold; a finite 
probability exists for the electron spin to branch into the 
ms = +1 manifold, yielding an undetermined reading, and 
for the nuclear spin to flip producing a 'definitely changes' 
outcome. Error bars show ±2 a / 95% confidence intervals. 



M, 
N 



Alice's Marginals 














1 ' 1 


1 



^ n^H M, Wins 

-f ^« M, Loses 

- %■ — Mj Wins 
s V M, Loses 




K, -1 



-1 



-1 +3 



+' Q2 ° 



FIG. 4. Violation of a Leggett-Garg inequality in 
the 'three-box' game, a) Alice's measurement M3 is in- 
dependent of Bob's choice to perform measurement Afi , M2 
or neither (A'^). b) The observations of Bob and Alice are 
correlated to indicate the probability that Bob has (or has 
not) seen state Mj given that Alice has seen M3, deter- 
mining who 'wins' the game. Alice's probability to 'win' 
exceeds 50% for both Bob's choices Mi and M2. c) Four 
MR-compatible histories that extremise (K) are illustrated 
by four trajectories passing through different boxes during 
the game. A trajectory entirely within the white j = 3 boxes 
has Q{i,2,3} ~ +1 and yields (K) — +3. Histories that visit 
other boxes yield (K) = — 1. d) The {K} values of the four 
paths are shown the corners of the (Q2,Q3) graph. Values 
for {Q2) and (Q3) from MR-compatible experiments must 
lie inside the shaded square satisfying — 1 < (K) < 3. Our 
measurement lies on the The value of (K) = —1.265 that 
we measured lies cyan curve outside this region. 



dition to finite measurement precisio iP^, all classi- 
cal models presented to date that exploit this loophole 
give rise to detectable measurement disturbances. In 
our experiment, Bob's intervening measurement intro- 
duces no disturbances detectable by Alice, and cannot 
be accounted for by existing classical models. 
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